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In the Claims 

33. (Previously presented) A system for automatically cataloguing documents located in multiple 
heterogeneous repositories, the system comprising: 

a scanr : ng tool for scanning the multiple heterogeneous repositories to collect keywords 
for the documents located therein; 

a keyword index to the documents built using the collected keywords; 

a mapping tool for mapping the documents using the keyword index to one or more 
classes, each of the one or more classes including keywords representative of that class; and 

a computing device for creating metadata indicative of each of the documents and 
cataloguing ea :;h of the documents in an integrated library according to the metadata in a meta- 
index, whereit the metadata for each of the documents indexed within the meta-index is stored in 
a pre-defined < ata structure including at least one of the following attributes a uniform resource 
locator (URL) a title, an author, an abstract, a collection, a keyword, one or more matched 
words, a path, a classmark, a classification date and a last modified date. 

34. (Previously presented) The system according to claim 33, wherein the meta-index retains 
characteristics Df each of the multiple heterogeneous repositories as applied to each of the 
documents sue li that a user may access one or more of the documents within the multiple 
heterogeneous repositories utilizing the meta-index. 

35. (Previously presented) The system according to claim 34, wherein the characteristics of the 
multiple heterogeneous repositories are transparent to the user when one or more of the 
documents are accessed using the meta-index. 
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36. (Previously presented) The system according to claim 33, wherein the metadata is stored in 
extensible Markup Language (XML) format. 

37. (Previously presented) The system according to claim 33, wherein the metadata is stored in 
Resource Description Framework (RDF) format. 

38. (Previously presented) The system according to claim 33, wherein the scanning tool is at 
least one spide r. 

39. (Previous!;' presented) The system according to claim 33, wherein the mapping tool is a 
domain ontoloijy. 

40. (Previously presented) The system according to claim 39, wherein the domain ontology is a 
classification hierarchy. 

41. (Previously presented) The system according to claim 33, wherein the mapping tool is a 
neural networl:. 

42. (Previous!; ,• presented) A method for automatically cataloguing documents located in 
multiple heterogeneous repositories, comprising: 

scanning the multiple heterogeneous repositories to collect keywords from the documents 
located thereir ; 
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buildin ;; a keyword index to the documents stored in the multiple heterogeneous 

repositories us: :ig the collected keywords; 

mappin g the documents using the keyword index into predetermined classes, wherein the 
mapping is pei formed using at least one mapping tool; 

creatirii i; metadata information, including identification of the predetermined class, for the 
documents; an I 

catalog uing each of the documents in an integrated library according to the metadata in a 
meta-index, w lerein the metadata for each of the documents indexed within the meta-index is 
stored in a pre defined data structure including at least one of the following attributes a universal 
resource locator, a title, an author, an abstract, a collection, a keyword, one or more matched 
words, a path, a classmark, a classification date and a last modified date and further wherein the 
meta-index reliiiins the characteristics of each of the multiple heterogeneous repositories as 
applied to each of the documents such that a user may access one or more of the documents 
within the mu.tiple heterogeneous repositories utilizing the meta-index. 

43. (Previous! «/ presented) The method of claim 42, wherein scanning the al least one 
information n .Tository to collect keywords is performed by a spider. 

44. (Previously presented) The method of claim 42, wherein the metadata information is stored 
in the extensible Markup Language (XML) format. 

45. (Previous;;/ presented) The method of claim 42, wherein the metadata information is stored 
in the Resoun :e Description Framework (RDF) format. 
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46. (Previously presented) A method for automatically cataloguing documents located on at least 
a first and seccud website, comprising: 

scannic .g the at least a first and second website to collect keywords from the documents 
located therein , wherein documents located on a first website are in a first format and documents 
located on a se ;rond website are in a second format; 

buildin g a keyword index to the documents stored on the at least a first and second 
website using 1 "ie collected keywords; 

mappk 3 the documents using the keyword index into predetermined classes, wherein the 
mapping is performed using at least one mapping tool; 

creatin ;; metadata information, including identification of the predetermined class, for the 
documents; an:l 

cataloguing each of the documents in an integrated library according to the metadata in a 
meta-index, w terein the metadata for each of the documents indexed within the meta-index is 
stored in a thir :1 format and further wherein the meta-index retains the first format and the second 
format, respec tively, for the documents in each of the at least a first and second websites such 
that a user ma; » access one or more of the documents within the at least a first and second 
website utilizt tg the meta-index. 

47. (Previousl;, presented) The method of claim 46, wherein scanning the at least a first and 
second websit : to collect keywords is performed by a spider. 
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48. (Previous!] presented) The method of claim 46, wherein the metadata is stored io the 
extensible Markup Language (XML) format. 

49. (Previously presented) The method of claim 46, wherein the metadata is stored in the 
Resource Description Framework (RDF) format. 
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